Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array
نویسندگان
چکیده
It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization methods in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have difficulty localizing the target talker among known multiple sound source positions. To cope with these problems, we propose a new talker localization method consisting of two algorithms. One algorithm is for multiple sound source localization based on CSP (Cross-power Spectrum Phase) analysis. The other algorithm is for sound source identification among localized multiple sound sources towards talker localization. In this paper, we particularly focus on the latter statistical sound source identification among localized multiple sound sources with statistical speech and environmental sound models based on GMMs (Gaussian Mixture Models) and a microphone array towards talker localization.
منابع مشابه
Blind Source Separation for Speech Application Under Real Acoustic Environment
A hands-free speech recognition system [1] is essential for the realization of an intuitive, unconstrained, and stress-free human-machine interface, where users can talk naturally because they require no microphone in their hands. In this system, however, since noise and reverberation always degrade speech quality, it is difficult to achieve high recognition performance, compared with the case ...
متن کاملCMSC 660 Project Solutions Optimization methods for Sound Source Localization using Microphone arrays
Microphone arrays are widely employed for applications like teleconferencing, high quality sound capture, speaker recognition/identification, acoustic surveillance, head aid devices, speech acquisition in automobile environments etc. For all these applications the benefits that a microphone array provides over a single microphone are two fold. First using a microphone array we can localize a so...
متن کاملSpeech Recognition of Double Talk using S
Double-talk recognition under a distant microphone condition, a serious problem in speech applications in a real environment, is realized through use of modified SAFIA and acoustic model adaptation or training. The original SAFIA is a high-performance audio segregation method based on band selection using two directivity microphones. We have modified SAFIA by adopting array signal processing an...
متن کاملRobust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array
of “Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array,” by Hoang T. Do, Ph.D., Brown University, May 2011 Microphone arrays have been used in many applications, such as: teleconferencing, speech recognition, talker characterization, speech enhancement, source localization and separation, etc. Despite the fast-paced develo...
متن کاملAcoustic Source Localization Based on Beamforming
Sound sources can be localized and analysed using phased microphone arrays.Beamforming is a method for processing microphone array data to produce images that represent the distribution of the acoustic source strength.It is an imaging technique that applies to continuous or discrete source distribution.A microphone array can be designed to be more sensitive to the sound coming from one or more ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001